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Abstract 

In the bootstrap percolation model, sites in an L by L square are 
initially independently declared active with probability p. At each 
time step, an inactive site becomes active if at least two of its four 
neighbours are active. We study the behaviour as p —* and L — > 
oo simultaneously of the probability I(L,p) that the entire square is 
eventually active. We prove that I(L,p) — ► 1 if liminf plogL > A, 
and I(L,p) — > if limsup plogL < A, where A = 7r 2 /18. We prove 
the same behaviour, with the same threshold A, for the probability 
J(L,p) that a site is active by time L in the process on the infinite 
lattice. The same results hold for the so-called modified bootstrap 
percolation model, but with threshold A' = 7r 2 /6. The existence of 
the thresholds A, A' settles a conjecture of Aizenman and Lebowitz ||, 
while the determination of their values corrects numerical predictions 
of Adler, Stauffer and Aharony 0j. 

1 Introduction 

We consider the bootstrap percolation process in two dimensions. Let Z 2 = 
{x = (xijXz) : xi,X2 6 Z} be the set of all 2- vectors of integers. Elements of 
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Z 2 are called sites. The neighbourhood N(x) of a site is 

N(x) = {y G Z 2 : \\x - y\\ = 1}, 

where || • || is Euclidean distance. So |iV(x)| = 4 for all x. Let K be a subset 
of Z 2 . Define B(lT) by 

B(K) = K U {x G Z 2 : |JV(ar) n AT| > 2}, 

and (iC) by 

(A) = lim 

where £>' denotes the t-th iterate. (If K is the active set at time 0, then £>' 
is the active set at time t and (K) is the active set at time oo). 

Now fix p G [0, 1] and let X be a random subset of Z 2 in which each site is 
independently included with probability p. More formally, denote by P p the 
product probability measure with parameter p on the product a-algebra of 
{0, 1} Z , and define the random variable X by X{u) = {x G Z 2 : u(x) = 1} 
for u G {0, l} z2 . A site x G Z 2 is said to be occupied if x G X. 

We say that a set K C Z 2 is internally spanned if (X H Z£) = K. A 
rectangle is a set of sites of the form 

R(a, b; c, d) := {a, . . . , c} x {b, . . . , d}, 

where a < c, b < d are integers. We also write R(c, d) = R(l, 1; c, d). Define 
the function 

I(L,p) = P p (^R(L, L) is internally spanned^. 
Our main result is the following. 

Theorem 1 Let L n ,p n be sequences such that L n — ► oo and p n — > 0. Then: 

(i) z/liminf p n \ogL n > A i/ien lim I(L n ,p n ) = 1; 

n— »oo n— >oo 

(ii) «/ lim sup p„logL n < A i/ien lim I(L n ,p n ) = 0; 
u>/iere 

A = — . 
18 

The main step in the proof of Theorem || will be Theorem below, concerning 
the probability of much smaller squares being internally spanned. 
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Theorem 2 

(i) lim sup sup —p log I(m,p) < 2A. 

(ii) lim inf lim inf — p log I(\_B /p\,p) > 2 A . 

B^oo p— *0 

(Here |_ - J denotes the integer part). 

We also establish the following result about the time-evolution of the 
process on Z 2 . Define 

j(t,p) = P p ((o,o)e 

(the probability the origin is active by time t). 

Theorem 3 Let t n ,p n be sequences such that t n — » oo and p n — > 0. Then: 

(i) i/ lim inf p„logt„ > A then lim J(t n ,p n ) = 1; 

n— »oo ra— >oo 

(ii) z/ lim sup p n logt n < A then lim J(t n ,p n ) = 0. 

The modified bootstrap model is a variant of the above model, in which 
the definition of B is replaced with 

B'{K) = KU jx G Z 2 : |{x + e i; x- e^} n K| > 1 for each of % = 1,2}, 

where ei = (1,0) and e2 = (0,1). We define /'(•,•) and J'(-,-) accordingly. 
The arguments in this article can be used to prove the following. 

Theorem 4 For the modified bootstrap model, the analogues of Theorems 
UH 7 [1 hold, with A replaced by 

Our results answer the main question posed in 0] in the case of two- 
dimensional bootstrap percolation. In that paper is was proved (for a wider 
class of models) that for p — > and L — > oo, we have I(L,p) — > 1 if 
lim inf p log L > c\ and I(L,p) — >• if lim sup p log L < C2, for different 
constants Ci,C2, and similarly for J(L,p). Other aspects of the model have 
been subsequently studied in detail (see 0,(0 an< ^ ^ ne references therein, for 



3 



example), but the natural question of whether ci, c 2 could be replaced with a 
single sharp threshold remained open. Our results answer this affirmatively, 
as well as establishing the precise value of the threshold. 

Predictions for the thresholds A, A' based on simulation are not in good 
agreement with our rigorous result. In 0,[|, the estimates 0.245 ± 0.015 
for A and 0.47 ± 0.02 for A' are reported, whereas A = vr 2 /18 = 0.548311 • ■ ■ 
and A' = 7r 2 /6 = 1.644934- ■ •. The likely reason for this discrepancy is that 
it is necessary to take L extremely large in order to "see" the true limiting 
behaviour. (These simulations used values of L up to 28,800). Similar phe- 
nomena have been noted for several other variants of bootstrap percolation; 
for details see @ , @ , , , @ . 



Bootstrap percolation in three and higher dimensions presents a new 
set of challenges; for details see |5|] , II , [0] • ^ n particular for the three- 
dimensional model in which a site becomes active if at least three of its six 
neighbours are active, it was established in || that the form of the threshold 
regime is different: I(L,p) — ► 1 if lim inf p log log L > C\ and I(L,p) — ► if 
lim sup p log log L < C2~, the extension to other dimensions is treated in ||. 
The arguments required for these results are more sophisticated than those 
in ||, and it seems that the problem of establishing a sharp threshold here 
is likely to be correspondingly harder. 

Aside from their intrinsic mathematical interest, bootstrap percolation 
models find numerous applications, both directly and as tools in the analysis 
of more complicated systems. See for example PJ , J7| , |8]] , ]T2| , [[H| . 

In H it was also conjectured that stochastic Ising models show similar 
metastability behaviour. The analogue of the result in |§ was proved (in 
arbitrary dimension) in fT5| , and the analogue of our result was proved in 

We omit the proof of Theorem ^ regarding the modified bootstrap model. 
The proof is almost identical to those given for the earlier theorems, and in 
fact a few simplifications are possible. The main difference is that the defini- 
tion of a rectangle being "horizontally (respectively vertically) traversable" 
in Section |3| should be replaced with the statement that all the columns (re- 
spectively rows) are occupied, and the function g (see Sections 0, |3|, |]) should 
be replaced with /. 

The article is organized as follows. In Section |2] we introduce and solve a 
definite integral which gives rise to the constant A = 7r 2 /18. In Section |3| we 
give notation and basic results. In Section |] we prove Theorem ^| (i), and in 
Section |^ we deduce Theorems [l] and ^| from Theorem |2|. These proofs are 
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relatively standard; the rest of the article is devoted to the much harder task 
of proving Theorem ^| (ii). In Sections || [7], |8] and || we assemble the tools 
needed for this; the reader may prefer to skip the proofs in these sections on 
a first reading. Finally in Section [10] we complete the proof of Theorem 
(ii). 

We now present a sketch of the ideas behind the proofs in this article. 
Theorem p] may be deduced from Theorem |2] as follows. Roughly speaking, 
Theorem |2] states that if we take B large and then let p — ► 0, then I(B/p,p) 
behaves roughly as e _2As//p , where \b — > A as B — > oo. Now suppose L « e c / p 
for some c. It may be shown that the event that R(L, L) is internally spanned 
is roughly equivalent to the event that it contains some internally spanned 
square of side length B/p. This is because for B large, most intervals of 
length B/p contain an occupied site, so when a growing active cluster reaches 
approximately this size it is almost certain to grow to fill the whole square 
(such a growing cluster is sometimes referred to as a "critical droplet" in the 
literature). The expected number of internally spanned squares of side B/p 
in R(L,L) is approximately L 2 e~ 2As/:P « e 2(c ~ As)/p , which is > 1 if c > X B 
and <C 1 if c < A#. However, we may choose B so that A^ is as close to A as 
desired. A more careful application of these ideas proves Theorem [I], and a 
similar approach gives Theorem [3]. 

We now turn to the proof of Theorem The lower bound on / in (i) is 
relatively straightforward; the upper bound in (ii) is much harder. For (i), 
assume m = B/p and consider one way in which R(m,m) can be internally 
spanned. Suppose that (1, 1) is occupied, and that for each k = 2, . . . , m, 
the column with horizontal coordinate k and height k — 1 contains at least 
one occupied site, and similarly the row with vertical coordinate k and width 
k — 1. Then it is easily seen that each square with bottom- left corner (1,1) 
is internally spanned, so the active set grows to fill whole of R(m,m). This 
picture gives the correct lower bound for the modified bootstrap model, but 
for the bootstrap model one can do better. Since a site becomes active if it 
has active neighbours on two opposite sides, we can allow some of the rows 
and columns described above to be vacant. Roughly speaking, the growth 
will still take place provided there are never two adjacent vacant rows or 
columns (actually, a little more care is needed, see Section |J). A calculation 
shows that the probability of this event behaves roughly as e~^ 2//p ^o 9 w here 
g is a certain function which satisfies J °° g(z) dz = A = 7r 2 /18. This proves 
Theorem |2| (i), and also shows that in some sense the "natural" length unit 
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for the problem is 1/p. 

For the upper bound, Theorem || (ii), the basic idea is to consider all 
other possible ways in which R = R(B/p, B/p) could be internally spanned, 
and find upper bounds on the number of ways and the probability of each 
one. One way in which R could be internally spanned is for the growth 
described above to start from the center of the square R (say), and for an 
active square to grow in all four directions rather than just two. It turns out 
that the probability of this is essentially the same, e~( 2 ^ ^ 9 , roughly because 
growth by one unit each to the left and to the right is equivalent to growth 
by two units to the right. Hence there is in some sense no loss of generality 
in assuming that growth starts from the corner. However, there are many 
other ways in which this growth could occur. For example two adjacent rows 
as described above might be vacant, in which case the active region could 
still continue to grow horizontally for a while, until it encounters an occupied 
site in the vertical direction, at which point vertical growth can resume. It 
can be shown that such a possibility has much smaller probability than the 
one considered earlier. Indeed, each such growth history corresponds to an 
oriented path 7 in [0, oo) 2 (where the length is rescaled by a factor and 
it may be shown that the probability of such a history is roughly e~ w<a ^ p 
where w is a certain functional on paths defined as a path integral involving 
the function g. By the convexity of g, it can be shown (see Section |6|) that 
this functional is minimized along the main diagonal X\ = X2, with minimum 
2 f Q g, which corresponds to growth as a square as described above. 

At this point it is natural to try to get an upper bound on the probability 
of R being internally spanned by summing the above probability over all 
possible paths. However, the number of such paths is large, roughly 2 S//p , 
so this does not give the correct bound. The solution is to introduce a 
new "coarse- graining" length scale T/p, which is small compared with B/p 
but large compared with the lattice spacing 1. It may be shown that the 
probability of seeing some path which coincides with 7 at a set of points 
of spacing T is also approximately e~ w<a ^ p . This is because all such paths 
require roughly the same occupied sites. Actually, for this argument to work, 
we must assume that the growth starts not from the point (1,1) but from 
a larger rectangle, of size A/p, where T <C A < 1 < 5; the details are 
in Section [7]. The number of possible choices of a set of points at spacing 
T/p is much less than before (in fact, bounded in p), so now summing over 
all possibilities gives an upper bound of approximately e~^ 2 ^ p ^A 9 (and the 
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integral here is close to J °° g for A small and B large, since the latter integral 
converges) . 

However, there is another difficulty. There are other ways for R to be 
internally spanned which do not involve only growth starting from a single 
"seed". For example, the growing square described above might encounter 
another internally spanned rectangle S within R, and the two would combine 
to give a larger internally spanned rectangle without any of the intervening 
growth taking place. It seems unlikely that such an event could have higher 
probability, since the probability of a small internally spanned rectangle S 
should be much less than the probability of finding at least one occupied 
site in each of the corresponding rows and columns required for growth as 
described previously. In fact, most of the work in the proof is to rule out 
possibilities such as this. 

The idea is as follows. Suppose R is internally spanned. Then we expect 
to be able to find a sequence of successively slightly smaller internally spanned 
rectangles inside R (corresponding to the growth picture above). However, 
at some point there may be a "split" into two separate internally spanned 
rectangles. Now we apply the same reasoning to each of these, and so on. 
In this way we obtain a "hierarchy" - a tree structure of nested rectangles 
each of which is internally spanned. We can arrange that when there is 
no splitting, the sizes of consecutive rectangles differ by approximately the 
coarse-graining scale T/p; when there is a split, the two offspring rectangles 
might be much smaller than the parent, but they should have the property 
that they "span" the parent, in the sense that bootstrap percolation starting 
with the two offspring rectangles completely filled results in the parent being 
completely filled. Also, for the coarse-graining argument to work, we should 
stop whenever the rectangles in a line of descent become too small (perhaps 
smaller than A/p), and declare such a rectangle to be a seed. 

Now, another difficulty arises. Since there may be many splits, there 
may also be many seeds; indeed R could be "almost all seeds" in which case 
there is no room left for the growth estimates used previously. The solution 
is to use the fact that seeds are small, together with an a priori bound on 
the probability a small rectangle is internally spanned, to show that there 
are "not too many" seeds. More precisely, we introduce yet another length 
Z/p <C A/p, and declare a rectangle a seed if its size is less than Z/p (the 
exact meaning of "size" turns out to be a delicate issue here). We show 
that the probability that the total perimeter of all the seeds is more than 
A/p is very small. Then, provided the total perimeter is less than A/p, a 
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geometrical argument together with a variational principle can be used to 
show that the probability of the whole hierarchy is approximately at most 
e -(2/p)/ A g ^he order of choosing the constants is important here: they must 
be chosen in the order B, A, Z, T, then finally p — > 0. The total number of 
hierarchies is large, but it can be shown to be at most p~ K where K depends 
on B but not p. Hence summing gives a bound of the correct form. 

At the heart of any such proof must be a (deterministic) result giving 
rigorous necessary conditions for a rectangle to be internally spanned. In 
Pfl, this role was played by a result which we shall also make use of, Lemma 
p~0| . For the construction of hierarchies, the required condition is provided 
by an apparently new result, Proposition |3l], which roughly speaking states 
that any internally spanned rectangle must be spanned by a pair of smaller 
rectangles which are themselves internally spanned disjointly (in the sense of 
the Van den Berg-Kesten inequality). 

2 Integrals 

We define the functions / : (0, oo) — > (0,oo), ft : (0,1) — > (0,1), and g : 
(0,oo)^(0,oo) by 

/(^) = -log(l-e- 2 ), 

u+ ^u(A - 3u) 
Pw = , 

g(z) = - log (3(1 -e~ z ). 

Note that / and g are continuously differentiable, positive, decreasing and 
convex. To see the latter for g note that the function —z 1— »■ g(z) is the 
composition of the increasing, convex functions z i— > e z , z > —ft(l — z) and 
z 1 — > — log(— z). Observe also that ft(u) > u so f(z) > g(z) for z G (0,oo). 
Finally note that f(z),g(z) — > 00 as z — » 0, and f(z),g(z),zf(z),zg(z) — > 
as z — > 00. 

Proposition 5 

(i) 

f°° 7T 2 

I f ^ dz = r> 
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(ii) 

f°° 7T 2 

Jo 9{z)dz= 18- 



We define A to equal the second integral. 

Proposition || (ii) is somewhat remarkable; the integral does not appear in 
standard tables such as @, and seems not to be directly solvable by computer 
programs such as Mathematica. The proof which we shall give is mysterious, 
and uses many special features of the function g. 

Proof of Proposition |5|. In (i) we substitute x = e' z to obtain 
-log(l-e _ *)d«= / -log(l-a;)— = 

JO 2 

by H, number 4.291.2. 

The integral in (ii) converges because g < f. Substituting y = e~ z and 
then taking out a factor 1 — y, we have 

i 

g(z) dz 



3-« 



. l-e~ z + v/(l-e- 2 )(l + 3t , , 
log dz 



, y/(l-y)(l + 3y) dy 
- log — 



/ -log(l-y) ^+ / -log 

Jo y Jo 



1 1 _l / l+3y 

, r , V x -y d v 



y 



Note that the first of these two integrals converges by the above, hence the 
second must also. 

In the first integral we make the substitution x = 1 — y. In the second 
we make the substitution 



x 



which yields 

1 — x 

y = ~\ — t — 2 

1 — X + x z 
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and 



dy 



(2 — x)x 



-dx, 



y (x — 1)(1 — x + x 2 ) 
and also interchanges the limits and 1. Thus we obtain 



g(z) dz 



1 , dx 
— logx 







logx 



logx 



1 — X 

1 



(2 — x)x 

7 TTn T~*\ 

o (x — — X + x z ) 



dx 



(2 — x)x 



1 — x (x — 1)(1 — X + X 2 ) 
2x - 1 



dx 



1 — x + x" 1 



dx 



logx log(l — x + x 2 



log(l — x + x ) — (by parts) 

JO J X 



f\ fl + x 3 \ dx 

°-y. iog (—) t 



log(l + x) / log(l + x 6 ) — 

o x Jo x 



dx 



f 1 dx f 1 du 

I log(l + x) / log(l + u) — (substituting u = x 3 

Jo x J 3u 

-J log(l + x)- 





\ \ (by 1, 4.291.1) 

iL 

18' 



□ 



3 Basic Notation and Results 

In this section we introduce basic tools which will be used throughout. 
The following elementary properties of (■) will be fundamental: (K) D K; 
((K)) = (K); if K C K' then (K) C (K'). These have the important 
consequence that if K C K' C (K) then (K') = (if). 
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Let R = R(a, b; c, d) be a rectangle. By the dimensions of R we mean the 
2-vector 

dim(H) = (c - a + 1, d - b + 1). 

If dim(i?) = (m,n) we define the s/jori side short (i?) = min{m, n}, the long 
side long(i2) = max{m, n} and the semi-perimeter <p{R) = m + n of R. 

A site i G Z 2 is occupied if i 6 X. A set of sites if C Z 2 is /u/Z if every 
site in K is occupied, and occupied if at least one site in K is occupied. It 
will be convenient to write 

q = -log(l -p). 

Note that q > p and g ~ p (that is, g/p — > 1) as p — > 0. The advantage of 
this notation is that 

P P {K is occupied) = 1 - e~ lKlq = e ~ fmq) (6) 

(where / was defined in Section Q). 

We say that a sequence of events (Aj) has a double gap if there is an 
adjacent pair A iy A i+ i neither of which occurs. 

Lemma 7 In a sequence of k independent events each with probability u G 
(0, 1), the probability afc(u) that there are no double gaps satisfies 

(3(u) k < a k (u) < Pinf- 1 

where 

u+ a/u(4 - 3m) 
= g ' 

Proof. By induction on k, on noting that a = a% = 1, a k+2 = uak+i + 
(1 - u)ua k , < (3 < 1, and /? 2 = it/3 + (1 - u)u. □ 

Let R = R(a, b; c, d). For a < i < c (respectively 6 < j < d), column i (re- 
spectively row j) of R is the rectangle R(i,b;i,d) (respectively R(a, j; c, j)). 
We say that R is horizontally (respectively vertically) traversable if the se- 
quence (column i is occupied)£ =a (respectively (row j is occupied)^ =fe ) has no 
double gaps. The following definitions will also be convenient. The rectangle 
R is East-traversable (respectively North-traversable) if it is horizontally (re- 
spectively vertically) traversable and in addition column c (respectively row 
d) is occupied. 

Recall the definitions of / and g from Section f|. 
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Lemma 8 If R is a rectangle with dimensions (m,n) then 



(i) 

e - mg (ng) < p ^ R is horizonta ii y traversable) < e -^-i)g(.ng) ,. 

(ii) 

e - mf (n q ) < e -( m -i) 9 (n g )-f(n q ) < ^ East-traversable) < e - m9(nq) . 

And similar inequalities hold for vertical and North- travers ability, with m 
and n exchanged. 

Proof. Part (i) follows immediately from (|5[), Lemma [F]and the definition 
of g. For the upper bound in (ii), note that if R(a, b; c, d) is East-traversable 
then R(a,b;c+ l,d) is horizontally traversable, and use (i). The second 
inequality in (ii) is straightforward, and the first follows because / > g. □ 

To see the usefulness of the above concepts note the following. 
Lemma 9 

(i) If R is internally spanned then R is East- and North- traversable. 

(ii) If Ri = (a, 6; c, d) is internally spanned and R2 = (c + 1, b; e, d) 
is East-traversable then R\ U R2 is internally spanned. And a similar 
statement holds for North-travers ability. 

Proof. For part (i), note that if two adjacent columns or the East-most 
column of R contains no occupied sites, then no site in these columns can be 
in (R H X). A similar remark applies to rows. For (ii), it is easy to see that 
each successive column in R 2 (moving in the East direction) is in (R n X). 
□ 

An event A of {0, l} z2 is called increasing if whenever u e A and u' > 10 
we have uj' G A. The FKG inequality states that for increasing events A, B 
we have P P (A n B) > P P (A)P P (B) (see Q p34 for example). 
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Figure 1: An illustration of the event A. The arrows indicate East- and 
North- traversability. 

4 Lower Bound 

Proof of Theorem [2] (i). Let r = Lj>~ 1/2 J , and let A = A(m,p) be the 
event that all the following occur: 

R(l, 1; 1, r) and R(l, 1; r, 1) are full, 

the sites (m, 1) and (l,m) are occupied, 
and for all integers k > 1: 

i2(Ar + 1, 1; At + r, At) is East-traversable, 

At + 1; At, At + r) is North-traversable. 

See Figure 0. 

Using Lemma |9| (ii), it is easily seen that if A occurs then R(m,m) is 
internally spanned, hence using Lemma |8| (ii) and the FKG inequality, 

oo 2 

I(m,p) > P P {A) > p 2r+1 Y[ (V (r - 1)9(fcr9) - /(fcr(j) ) , 

k=l 
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for all m. Hence, 

sup — p log I(m,p) 

m>l 

2(r — l)p °° 2p 00 
< — (2r + l)p\ogp H g(krq)rq H f(krq)rq 

r 1 k=i r 1 k=l 

f°° 2p f°° 

/ g(z) dzA / f(z) dz 

Jo rq J 



2p 

< — 3rplogp-\ 

Q 



where in the last step we have use the fact that / and g are decreasing. Now, 
recalling that q ~ p, r = |_j>~ 1//2 J , and J °° f(z) dz < 00, we see that as p — > 
the above expression converges to + 2 J °° g(z) dz + = 2A, as required. □ 



5 Metastability 

In this section we deduce the (i) parts of Theorems [I] and ^ from the (i) part 
of Theorem El and similarly for the (ii) parts. 

Proof of Theorem [I] (1). It is clearly sufficient to prove that for 
any e > 0, if p n — > and L n — » 00 are such that p n logL n > A + e then 
I(L n ,p n ) -> 1. 

Suppose plogL > A + e and p < 1/2. Let m = [p~ 3 \, and let S = 
S(L,p) be the event that R(L,L) contains at least one internally spanned 
rectangle of dimensions (m, m). By dividing R(L,L) into disjoint rectangles 
of dimensions (m, m) we see that 



p P (S) >i-(i-/M 



[L/mf 



SO 



-log(l- P p (S))>^I(m,p). 



Therefore 



plog 



— log(l — P P (S)) > 2p\ogL + 6p\ogp — plog2 + p\ogI(m,p). 



Hence, by Theorem (i) we have 



lim inf p n log 



- log(l - P Pn (S n )) > 2(A + e) + - - 2A = 2e, 
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where S n = S(L n ,p n ), so in particular P Pn (S n ) — > 1. 

The following is proved in [pj. There exists a sequence of increasing events 
H n defined in terms of the states of sites in R(L n , L n ) such that P Pn (H n ) — ► 1, 
and if and if n occur then R(L n , L n ) is internally spanned. This proves 
the result, since by the FKG inequality, 

I(L n ,p n ) > P Pn (S n )P Pn (H n ) ^ 1. 

Here is a sketch of the construction of H n . If p n logL n — > a > A then 
we may take if n to be the event that every rectangle of dimensions (m, 1) or 
(l,m) in R(L n ,L n ) is occupied. If L n grows faster than this then we must 
also use a renormalization argument, dividing R(L n , L n ) into disjoint squares 
of size e( x+s ^ p ", so that the probability each one is internally spanned exceeds 
the critical probability for site percolation on Z 2 . □ 

Proof of Theorem |3| (i). Note that, in contrast with J, J(t,p) is clearly 
increasing in t, so we may assume that p n \ogt n — > A + e, where e > 0. Let 
m n = \j>~ 3 \ and L n = \t n J (3m n )J; then p n logL n — » A + e. It is easily seen 
that the events H n in the proof of Theorem [l] (i) above may be chosen in 
such a way that if S n and H n occur then the whole of R(L n , L n ) becomes 
active in time at most 

\R(m n ,m n ) \ +2m n L n < t n 
(The inequality holds if p n is sufficiently small). Hence, 

P n ((l, 1) G &"(X n i?(L n , L n ))) - 1, 
therefore J(t n ,p n ) — ► 1. □ 
To prove Theorem ^ (ii) we need the following result from || . 

Lemma 10 Let k be a positive integer and let R be a rectangle with long(R) 
> 2k + 1. If R is internally spanned then there exists an internally spanned 
rectangle T C R with long(T) G [k, 2k + 1]. 



The idea of the proof of Lemma 10 is to construct the bootstrap percola- 
tion process by an algorithm which sequentially replaces a pair of internally 
spanned rectangles with a larger internally spanned rectangle; the result then 
follows because at each step the long side of the longest rectangle increases at 
most as n i— > 2n+ 1. For the details see [EL We shall use a similar argument 



in Section M to prove Proposition 31 
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Figure 2: The rectangles T,S X ,S2- The arrows indicate North- and East- 
traversability. 

Proof of Theorem [T] (ii). It is clearly sufficient to show that for any 
e > 0, if p n — > and L n — > oo with p n \ogL n < A — e then I(L n ,p n ) — > 0. 
We write A# = \ v^-b'>b liminf p ^ — plog/( [B'/p\ ,p), so that Theorem g 
(ii) states lims^oo > A. 

Fix e > and B > 0, and suppose that plogL < A — e. We write R = 
R(L, L) and let K = [B/p\ and k = [B/(2p)\ - 1, so that k < 2k + 1 < K. 
Assume that p is sufficiently small that 1 < k < K < L. We claim first that 
if T is any rectangle satisfying long(T) e [k, 2k + 1] then 

P P {T is internally spanned) < e 2Kf{kq) I(K,p). (11) 

To prove this, suppose without loss of generality that T is of the form T = 
R(a, b) where a e [A;, 2fc + 1] and b < K. Let S x = R(l, b+l;a, K) and S 2 = 
R(a + 1, 1; K, K)\ see Figure 0. By Lemma |9] (ii), if T is internally spanned, 
and 5*i is North-traversable, and S2 is East-traversable, then R(K, K) is 
internally spanned. Hence by Lemma (ii) we have 

I{K,p) > P P (T is internally spanned)^^^^-^^ 

which yields (O) since / is decreasing. 

Hence using Lemma ^ and (0) we have 

I(L,p) <L 2 K 2 e 2Kf ^I(K,p). 

Here L 2 K 2 is an upper bound on the number of possible choices for the 
rectangle T. Hence 

p log I (L,p) < 2plogL + 2plogK + 2Kpf(kq) + p log I(K,p). 
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Therefore, recalling that p n \ogL n < A — e, and the definitions of k,K,X B , 
we have 

limsupp* log < 2(A - e) + + 25/(5/2) - 2X B . 

Since lim.B^oo Bf(B/2) = and lirriB_ ) . 00 A£ > A, we may choose B suffi- 
ciently large (depending on e) that the above expression is negative, so in 
particular I(L n ,p n ) — >• 0. □ 

Proof of Theorem [3] (n). In consequence of the results (5. 4), (5. 5) of 
0, if I(L,p) — > then J(L,p) — > 0. Hence the result follows from Theorem 

m(h). □ 



6 Variational Principles 

We write = (0, oo) 2 = {a = (ai,a 2 ) : a 1; a 2 G (0, oo)} for the set of 
all 2-vectors of positive reals. Boldface letters will denote elements of M 2 ^ 
unless stated otherwise. We write a < b if &i < bi and a 2 < b 2 . Let g 
be a continuously differentiable, positive, decreasing, convex function from 
(0, oo) to (0, oo) (g is otherwise arbitrary for the purposes of this section). A 
(piecewise-linear, oriented) path 7 from a to b is a subset of consisting of 
the union of a finite sequence of points a = u° < u 1 < • ■ • < u k = b (called 
the vertices of the path) together with the line segments {atf + (1 — a)u t+1 : 
a G (0, 1)} joining the vertices in order. A path 7 may be parameterized as 

7 = {(x(t), y(t j) : t G [a, b]} where x(t) + y(t) = t. We define the functional 
w(j) as the path integral 



w (i) = J (g{y)dx + g{x)dy s j = J \g(y)-^ + 9i x )-^:J dt , 

and for a < b we define 

W(a, b) = inf 10(7) 

7:a— >b 

where inf 7:a ^b denotes the infimum over all paths 7 from a to b. 

The purpose of this self-contained section is to prove the following four 
properties of W. Propositions [12], |1^ and |14] are fairly natural; Proposition 



15 is tailored to a specific application in Section 10 
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Proposition 12 // a < b < c then W(a, b) + W(b, c) > W(a, c) . 
Proposition 13 ^(a, b) < (bi — ai)g(a 2 ) + (b 2 — a 2 )g(ai). 
Proposition 14 // ai + a 2 = A and b = (B, B) where A < B then 

W(a,b) > 2 f g{z) dz. 

J A 

Proposition 15 Suppose that a < b; c < d; r > b; r > d; r < b + d+(g, q) 
and r > (2Z, 2Z) where q < Z. Then there exists s satisfying s < r and 
s < a + c such that 

W(a, b) + W(c, d) > W(s, r) - 2qg(Z). 

Indeed we may take s = aV[(a + c)A(a + c + r — b — d)]. 

In the above, V and A denote coordinate-wise maximum and minimum re- 
spectively. 

Proof of Proposition [12|. We have 
W(a, c) = inf 10(7) 

< inf ^(7) 

7:a^b^c 

= inf w (71) + inf 10(72) 

71 :a— >b 72 :b— >c 

= W(a,b) + W(b,c), 

where inf 7:a _ + b^c denotes the infimum of all paths from a to c containing b. 
□ 



Proof of Proposition 13. For any path 7 from a to c, since g is 



decreasing and u > a for all u e 7, we have 

= j(g{y)dx + g(x)dyj 

< 



7 



J (g{a 2 )dx + g(a 1 )dy^ 
(bi - a l )g(a 2 ) + (b 2 - a 2 )g(a l ). 



□ 
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Figure 3: The regions U and V. 



In order to prove Proposition [14| we need Lemma [16| below. For sets 
A,BC we say A lies Northwest of B and write A y B if for any a G A 
and b G B satisfying a! + a 2 = bi + b 2 , we have a 2 > b 2 . Let A be the 
"main diagonal" of M. 2 ,: 



A := {u G M+ : Ux = u 2 }. 



Lemma 16 1/71,72 ore pai/is from a to b ; and either 71 ^ 72 ^ A or 
A >: 7 2 y ji, then 10(71) > 10(72)- 

Proof. Without loss of generality (since the definition of w is symmetric 
in the two coordinates), we may assume 71 y 7 2 y A. 

First note that for any path 7 from a to b, we may express 10(7) as 



w{l) = JJ g'{y) dxdy+ (bi -ai)p(a 2 ) + // ^(ar) rfx rfy + (b 2 - a 2 )#(ai) 

(17) 

where the regions U, V are defined by 

U = {u : a < u < b and 7 y {u}}, 
V = {u : a < u < b and {u} y 7}; 

see Figure ^. To check flT7D note that by performing the y integral we have 

JJ g'{y) dx dy = J (g{y) - s-(a 2 )) dx = J g{y) dx - (b x - ai)3>(a 2 ), 
and similarly for the second integral. 
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Figure 4: A path 7 from a to b, and the optimizing path 70. 



Applying ( pf ) to 71 and 72 and subtracting, we have 

10(71) - 10(72) = j/jf (#'(i/) - g'{xfj dx dy, 

where if is the region between 71 and 72 : 

if = {u:a<u<b and 71 ^ {u} >: 7 2 }. 

Since 71 >z 72 b A, we have H >z A, and hence 1/ > 2 for (2,1/) G if, and 
since (7 is convex this implies that g'(y)—g'(x) > on if, so 10(71)— 10(72) > 0. 
□ 



Proof of Proposition [L4]. Let 7 be a path from a to b. We claim that 

> w(7o) 

where 70 is the path with vertices a, u, b, where Ui = u 2 = max{ai,a 2 } 
(see Figure |j). Thus W(a, b) = 10(70). To check the above claim, split 7 
into sections separated by the intersections of 7 and 70, and observe that 
by Lemma [16|, each section of 7 has a value of w at least as large as the 
corresponding section of 70 . 

Now since g is positive, 10(70) > w(7o) where 7 is the straight path with 
vertices u and b. From the definition of w we have 

r-B pB 

w (7o) = 2 / g(z) dz> g(z) dz. 

□ 
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To prove Proposition 15 we need Lemmas |H| below. 
Lemma 18 // a < b and k e [0, oo) 2 then W(a, b) > W(a + k, b + k) . 
(Note that here k might not be an element of 
Proof. We have 

ty(a + k,b + k) = inf w(j) 

7:a+k^b+k 

= inf iy(7 + k) 

■y:a— >b 

< inf w( , y) since g is decreasing 
= W(a,b), 

where 7 + k denotes the shifted path obtained by adding k to each point in 
7. □ 

Lemma 19 // a < b < c then W(a, c) > ^(b, c) . 



We prove Lemma [L9] via the following. 



Lemma 20 If a < c and either b = (a 1? c 2 ) or b = (c 1; a 2 ) then W(a, c) > 
W{b, c). 

Proof. Without loss of generality suppose that b = (ai, c 2 ). Let 7 be any 
path from a to b, and let 8 be the unique (straight, horizontal) path from b 
to c. Since g is decreasing and positive we have 



W(b,c) = w(5) = / g(y)dx < / g(y)dx < w(j). 

J 8 J-y 



□ 



Proof of Lemma [19]. Let 7 be any path from a to b. Without loss of 
generality suppose that {b} y 7. Let u = (a 1; b 2 ) and v = (v 1; b 2 ) where 

Vl = inf{t : (t,b 2 ) E 7}. 
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71 



Figure 5: Illustration of the proof of Lemma [19 



Write 7 = 71 U 72 where 71 is a path a to v and 72 is a path v to b (both 
of which are thus uniquely defined). Let 5 = Si U 62 where 5\ is the unique 
path from u to b and 62 is the unique path from b to v. See Figure [|. 
Now 



w(j) = w(7i)+u>(7 



2 J 



> w(8) + 10(72) by Lemma |20 

> w(5 2 ) + 10(72) 
= w(J 2 U 72) 

> W^b, c) since 62 U 72 is a path from b to c. 

□ 



We are now ready to prove Proposition |T^; we start with a version without 
the "error term" . 

Lemma 21 Suppose that a < b; c < d; r > b; r > d and r < b + d. Then 
there exists s satisfying s < r and s < a + c such that 

W(a, b) + W(c, d) > W(s,r). 

Indeed we may take s = aV(a — b + c — d + r). 

Proof. Define s as indicated, and note that s > a; s < a + c; and s < r. 
Then we have 

W(a,b) + W(c,d) 

> W(s, s — a + b) + W(r + c — d, r) by Lemma [18| twice 

> W(s, s — a + b) + W(s — a + b, r) by Lemma [19 

> W(s, r) by Lemma [l^. 
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Figure 6: The rectangles Ri, . . . , R$. 



In the first inequality we have used the facts that s — a > (0,0) and r — d > 
(0, 0). For the second (and third) we must check that r+c — d < s — a+b < r, 
which is achieved as follows: 



r - (s - a + b) = (r + a - b) - s = (r - b) A (d - c) > (0, 0), 



and 



(s-a+b)-(r + c-d) = s- (a-b+c-d + r) = (0, 0) V(b-c+d-r) > (0,0). 

□ 



Proof of Proposition [15]. Let r' = r A (b + d), and note that r' > b; 
r'>d; r'<b + d; r' < r and r'>r- (q, q) > (2Z - q, 2Z - q) > (Z, Z). 
Hence we may apply Lemma |2l] to obtain 

W(a, b) + W(c, d) > W(s,r') 

> W(s, r) — W(r', r) by Proposition [12 



> W(s, r) — 2qg(Z) by Proposition |T^ , 

where s = a V (a — b + c — d + r') = a V [(a + c) A (a + c + r — b — d)], and 
s < r' < r and s < a + c. □ 



7 Border Events 

Let R, R! be two rectangles satisfying R C R'. Define rectangles R\, . . . , Rs 
(some of which may be empty) according to Figure^, so that R' is the disjoint 
union of R, R\, . . . , R$. Define D(R, R') to be the event that each of the two 
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rectangles R\ U R$ U R7 and R 3 U _R 4 U i? 5 is horizontally traversable, and each 
of the two rectangles Ri U R 2 U -R3 and _R 7 U -R 6 U -R 5 is vertically traversable. 
One may think of D(R, R') as a necessary condition for the "border" between 
R and R 1 to be "traversable" from R to R' . More precisely, the event has 
the following properties. 

(i) If R' is internally spanned then D(R, R') occurs. 

(ii) D(R, R') is defined in terms of the states of sites in R' \ R. 

Property (i) holds because if R' is internally spanned then it must be hor- 
izontally and vertically traversable. The purpose of this section is to prove 
the following upper bound on the probability of D(R, R'). 

Proposition 22 For any Z > and c G (0, 1/2), there exist Q = Q(c, Z) < 
00 and T = T(c,Z) G (0, Z/2) such that for any rectangles R C R' with 
dimensions (m, n) and (m + s,n + 1) respectively, and any q > 0, provided 
m,n > Z/q and s,t < T/q we have 

P P {D{R, R')) < Q exp ( - (1 - 2c) [g{nq)s + g(mq)t] ) . 

In the applications of Proposition |2^, it will be essential that Q,T do not 
depend on q. 

Proof. Let H be the "corner region" , H = R 1 U R 3 U R 5 U R 7 , and let Y be 
the number of occupied sites in H, Y — \X n H\. The idea of the proof is as 
follows: if the four events in the definition of D were independent, the proof 
would be easy. If Y is small, then the events are nearly independent, and if 
s, t are sufficiently small compared with m, n, the probability Y is large can 
be made smaller than the bound we are trying to obtain for P p (D). 
Without loss of generality suppose that s < t. We have 

P(D) 

= P(D I Y < cs)P(Y < cs) + P(D I cs < Y < ct)P(cs <Y <ct) 

+P(D I Y > ct)P(Y > ct) 
< P(D I Y < cs) + P(D I cs < Y < ct)P(Y > cs) + P(Y > ct) (23) 
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We claim that the terms appearing on the right side of (^) may be bounded 
as follows 



P(D | Y < cs) < e ig{z) exp -(1 - 2c) [g(nq)s + g(mq)t] (24) 

P(D \cs <Y <ct) < e 2g(z) exp -(1 - 2c)g(mq)t (25) 

P{Y > cs) < exp-cs(logc-logT- 1) (26) 

P(Y > ct) < exp-ct(logc- logT - 1) (27) 



for a suitable choice of T. 

The inequality (|26|) is an instance of the Chernoff bound, as follows. Y 
is a binomial random variable with parameters st and p, so 



provided tp < tq < T. (T will be a function of Z and c to be chosen later). 
The bound (^7j) follows similarly. 

To prove fl25|), we condition further on which sites in H are occupied. If 
for each choice of this set of sites (satisfying cs < Y < ct) the conditional 
probability of D is bounded above by the right side of (0), the desired bound 
will follow. For simplicity, suppose first that Y = ct, and also that the ct 
occupied sites in H all lie in different horizontal rows. These sites (together 
with R) split i?2 U -^6 i^o ct + 2 "horizontal strips" , some of which may be 
empty. (More precisely, if we remove from R 2 U Rq every row which contains 
an occupied site in H, we are left with ct + 2 rectangles of width m, some of 
which may be empty). In order for D to occur, a necessary condition is that 
each of these strips is vertically traversable. By Lemma |8| (i), the probability 
of this is at most 

exp — [t — ct — (ct + 2)]g(mq) 

(since the sum of the vertical heights of the strips is t — ct, hence the sum of 
their heights minus one is t — ct — (ct + 2)). Provided mq > Z, the above 
expression is at most 



P(Y > cs) 



< 



< 



< 



P(e a(y - cs) > 1) for any a > 
E(e a{Y - cs) ) 

exp[— csa + st log(l — p + pe a )] 
exp[— csa + stpe a ] 

exp[— cs log(c/ (tp)) + cs] taking e a = cj (tp) 
exp — cs(logc — logT — 1), 



e 2g{z) exp-(l - 2c)g(mq)t 



(28) 
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since g is decreasing. Now, if Y < ct, or if some of the occupied sites in H 
lie in the same horizontal rows, clearly the conditional probability of D will 
be even smaller, hence we have proved (|25|). 

The bound ([24]) is proved similarly. If Y < cs then Y < ct also, and by 
conditioning on the occupied sites in H we obtain a collection of horizontal 
strips in R 2 U Rq together with a collection of vertical strips in R^U R 8 . 
For D to occur, each of horizontal strips must be vertically traversable, and 
each of vertical strips must be horizontally traversable, but these two events 
are independent (conditional on the set of occupied sites in H), so the two 
bounds corresponding to (|28| ) are multiplied, to obtain the right side of ([24]). 

Now choose T > sufficiently small that 

c(logc - logT - 1) > 2(1 - 2c)g(Z) 
(and also T < Z/2). Since g is decreasing this ensures that 

exp — os(logc — logT — 1) < exp —(1 — 2c)g{nq)s (29) 

and also 



exp — ct(logc — logT — 1) < exp —(1 — 2c)[g{nq)s + g(mq)t). (30) 

In the latter we have used the fact that s < t so 2g(Z)t > g{Z)s + g{Z)t. 
Now substituting (g§-(0) into (g|) and using we obtain 

P(D) < 3e 49(z) exp-(l - 2c) [g{nq)s + g(mq)t). 

□ 



8 Disjoint Spanning 

For a collection of increasing events A\, . . . , A\. on {0, l} z2 , the event A\ o 
• • • o Ak that A\, . . . , Ak occur disjointly is defined as the event that there 
exist pairwise disjoint full sets of sites Kx, . . . such that for each i, Ai 
occurs whenever Ki is full. The BK inequality states that if A x , . . . , A^ are 
defined in terms the states of a finite set of sites then 

P p (A 1 o...oA k )<P p (A 1 )---P p (A k ). 



For more details see for example 11 p37. 
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Figure 7: In this example the rectangles R', R" must overlap. 

Proposition 31 Let R be a rectangle with \R\ > 2. If R is internally 
spanned then there exist two distinct non-empty rectangles R', R" such that 

(i) the strict inclusions R' C R, R" C R hold, 

(ii) (R U R") = R, 

(iii) {R' is internally spanned} o {R" is internally spanned} occurs. 

To see the subtlety of Proposition |3T], note that we cannot in general take the 
two rectangles R', R" to be disjoint; in Figure |7| for example, the whole square 
R is internally spanned, but the only possible choice for R', R" is the pair 
of 6 by 6 squares indicated. It is for this reason that the concept of disjoint 
occurrence is important. The idea of the proof of Proposition |3l] is simple: 
we run an algorithm which produces successively larger internally spanned 
rectangles, combining two rectangles into one at each step and eventually 
obtaining R; then we consider the last step. The details of the proof require 
a little more care. 



Proof of Proposition [31]. If K is any finite set of sites, we may con- 



struct (K) via the following algorithm. For each time step t — 0, 1, . . . , r, we 
shall construct a collection of m t rectangles R[, . . . , R^, and corresponding 
sets of sites K\, . . . , K % , with the following properties: 

(i) K{, . . . , K l mt are pairwise disjoint; 

(ii) K\ C K] 

(iii) R\ = (Kf) ; 

(iv) if i j then R\ % i& 
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(v) K CH f C(K), where 



i=i 

Initially, the rectangles and sets of sites are just the individual sites of 
K. That is, let K be enumerated as K = {x±, . . . , and set m Q = k and 
= = {xi}, so that in particular 

H° = K. 

The final set of rectangles R[, . . . R T rilT will have the property that 

H T = (K). (32) 

Before describing the algorithm we make the following observation. Let 
Ri,R 2 be two distinct rectangles neither of which is a subset of the other, 
and consider (R 1 U R 2 ). The following three possibilities exist. 

(a) U R 2 ) = R\ U i?2, and R\ U R 2 is not connected. 

(b) (Ri U R 2 ) — Ri U R 2 , and Ri U i? 2 is a rectangle. 

(c) (Ri U R 2 ) D R\ U i? 2 , and /?! U i? 2 is a rectangle. 

(As usual, a set of sites is said to be connected if any two sites can be joined 
via a sequence of sites at Euclidean distances 1). 

The algorithm proceeds as follows. Suppose R\, . . . , R l mt and K{, . . . , 
have already been constructed. 

Step (I). If there do not exist a pair of rectangles Rj, i?*- with i 7^ j such 
that (Rj U it!*-) is a rectangle (that is, if case (a) above holds for all 
pairs), then STOP, and set r = t. 

Step (II). If there do exist R\, i?*- with i ^ j such that (i?-Ui?*) is a rect- 
angle (case (b) or (c) above), then choose one such pair of rectangles. 
Denote the rectangle {R\ U i?*) by R'. Also let K' — K\ U JTj. 

Step (III). Construct the state (R\ +1 , K[ +1 ), {R^ +v K%£ +1 ) at time 
t + 1 as follows. From the list (i?J, K{), . . . , (i?^ t , -f^^J at time t, delete 
every pair (Rj, Kj) for which R\ C i?'. This includes (i?*, and 
(R^Kj), and may include others. Then add (R',K') to the list. 



28 



Step (IV). Increase t by 1 and return to Step (I). 

It is straightforward to see that properties (i)-(v) are preserved by this 
procedure. Also m t is strictly decreasing with t, so the algorithm must stop 
eventually. To check that fl32|) is satisfied, observe that if there exists a site 
x G (K) \ H T , then there must exist a site y G (K) \ H T having at least 
two neighbours in H T , but these neighbours must lie in distinct rectangles 
Ri,Kj, and hence the algorithm should not have stopped at time r. 

Furthermore, observe that if (K) is a single rectangle R, then we must 
have m T = 1 and R[ = R. If not, since H T = R, there must exist two distinct 
rectangles RJ, i?J whose union is connected, and again this means that the 
algorithm should not have stopped. 

Finally, to prove the proposition, note that if R is internally spanned then 
running the algorithm on the set of sites K = R R X results in m T = 1 and 
R[ = R. If \R\ > 2 then there must have been at least one step, r > 1. Now 
considering the last time step of the algorithm (from time r — 1 to time r) 
we obtain the two rectangles R' = -R[ _1 , R" = -RJ -1 with all the required 
properties. □ 



9 Hierarchies 

A directed graph is a set of vertices V together with a set of ordered pairs of 
vertices E. If (u, v) G E then we say there is an edge from u to v and write 
u ~» v. The children of a vertex u are all the vertices v such that u^-v. 

A hierarchy Ti is a finite directed graph in which every vertex v is labeled 
with a non-empty rectangle R v (where the rectangles corresponding to differ- 
ent vertices are not necessarily distinct), with the following properties. The 
graph is a tree. There is a special vertex r called the root, and all edges are 
directed away from the root. (So for any vertex v there is a unique directed 
chain of vertices r = w ^> u% ~> • • • u\. = v ). If u ~> v then we have the 
strict inclusion R u D R v . Every vertex has 0, 1 or 2 children. A vertex with 
no children is called a seed. If u has exactly one child v , we call u normal 
and write u v. If u has two children v, w we call u a splitter and write 
u (v, w). If u =1 io ) then we have (_R„ U iC) = See Figure [| for an 
example of a hierarchy. 
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Figure 8: An example of a hierarchy. 



We say that the hierarchy Ti occurs if all the following events occur dis- 
jointly: 

D(R V , R u ) for each pair u, v such that u =>• v, 

and 

{R w is internally spanned} for each seed w. 

(Note in particular that if for example R v = R w for two distinct seeds v, w 
then we require that the event {R v is internally spanned} occurs twice dis- 
jointly; that is, there are two disjoint sets of occupied sites each of which 
internally spans R v ). 

Fix q,T,Z satisfying < 4g < 2T < Z < 1/2 (in fact we shall be 
concerned with the case 0<g<T<Z<l). We call a hierarchy H good 
if it satisfies all the following. 

(i) If w is a seed then 

short (R w ) < 2Z/q- 

(ii) if u is normal or a splitter then 

short (R u ) > 2Z/q; 

(iii) if u =3- v and v is a seed or a normal vertex then 

<P(R u )- ( j ) (R v )e[T/(2q),T/q]; 
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(iv) if u => v and v is a splitter then 



4>(Ru) 



4>(Rv) < T/q; 



(v) if u =4 (f , if ) then 



and 



<f>{Ru) 
(f>(R u ) 



<P(R V ) > T/(2q) 
<!>{R W ) > T/(2q). 



Proposition 33 Let q,T,Z satisfy < 4g < 2T < Z < 1/2, and let R be 
a non-empty rectangle. If R is internally spanned then some good hierarchy 
with root-label R r = R occurs. 

Proof. The proof is by induction on the size of the rectangle. Let R be 
a rectangle and suppose the proposition holds for all rectangles with semi- 
perimeter less than <fi{R). 

If short (i?) < 2Z/q, then the good hierarchy having only one vertex r 
(which is the root and a seed), and R r = R, occurs. 

If short (i?) > 2Z/q then we construct a sequence of non-empty internally 
spanned rectangles R = S D S% D • • • D S m by an algorithm as follows. The 
idea is that Si, . . . , S m are successive attempts to find a rectangle S such that 
<j){R) —<p(S) is in the range [T/(2q),T/q]; the attempt may succeed in which 
case the root will be a normal vertex, or we may "overshoot" in which case we 
need to introduce a splitter. Here are the details. Given Si, apply Proposition 
[U] to obtain two rectangles S^, S". Let S i+ i be the one of S[, S" with the 
larger semi-perimeter <f> (choosing according to an arbitrary rule if they are 
equal). Stop, after m > 1 steps, the first time 4>(R) — <f>(S m ) > T/{2q). 
(This must occur eventually because the sequence of rectangles is strictly 
decreasing, and <f>(R) > ^Zjq > T / (2q) + 2, so that a rectangle S containing 
only one site does satisfy (f)(R) — (fi(S) > T/{2q); and we must have m > 1 
because T/(2q) > 0). 

Now consider the following three possible cases. 



(i) If 0(R)-0(S m )< T/q, 

then we have that <p(R) — 4>(S m ) G \T / {2q),T / q], S m C R, and R and 



S m are internally spanned. By the inductive hypothesis, there exists a 
good hierarchy Ti! with root r' and root-label R r > = S m . Furthermore, 
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the events D(S m ,R) and {TC occurs} occur disjointly, since they are 
defined in terms of disjoint sets of sites. We construct a hierarchy TC as 
follows: start with TC, and add a new vertex r, with R r = R, and an 
edge from r to r' so that r =>■ r'. It follows from the above observations 
that TC is good and occurs. 

ii) If <f)(R) - <p(R m ) > T/q and m = 1, 

then there exist disjointly internally spanned rectangles S' , S' ' such 
that (S'q U S%) = R, and <j>{R) - (f>(S' ), <f>(R) - </>(S$) > T/q > T/(2q). 
By Proposition [31] and the inductive hypothesis there exist disjointly 
occuring good hierarchies TC, TC" with roots r', r" and root-labels R r > = 
S' , R r " = Sq, and we may take the vertex sets of TC', TC" to be disjoint. 
Now we construct TC from TC' U TC" (this last object being a labeled 
directed graph defined in the obvious way) by adding a new vertex r, 
with R r = R, and edges so that r =4 (r', r"). It is easily seen that TC is 
a good hierarchy and occurs. 

iii) If (f>(R) - <f){R m ) > T/q and m > 2, 

then we have internally spanned rectangles S m -i, <S^_i, •S'm-i satisfying 
i2 D S m _i and 5 m _i = (5'^_ 1 U S'^), where - ^(S^-i) < 

T/(2g) and - (piS^) , (f>(R) - 0(S'*_ 1 ) > T/g, and therefore we 
have (f>(S m -i) <f>(S m -i) -<t>{S'm-i) >T/(2q). By Proposition 
5T] and the inductive hypothesis there exist disjointly occuring good 



hierarchies TC',TC" with roots r',r" and root-labels _R r / = S' m _ x ,R r n = 
S^_i, and we may take the vertex sets of TC ,TC" to be disjoint. Now 
we construct 7^ from TC U W by adding new vertices r, y with R r = 
R, R y = S m -i, and new edges so that r =^> y and y =4 (r', r"). Then it 
is easily seen that TC is a good hierarchy and occurs. 

□ 



10 Upper Bound 

We are now ready to prove Theorem |2| (ii). Fix B > 2, and let A = c = 1/B. 
We shall prove that 

hmmf-plog I ([B/p\,p) > 2(1 - 2c) / g(z) dz, 

P~*° J A 
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from which Theorem |2] (ii) follows. The approach is to use Proposition 
and obtain upper bounds on the number of possible good hierarchies, and on 
the probability that each one occurs. 

Choose Z > sufficiently small that Z < A/2 and 

9(2Z) > ^ (34) 

where A = n 2 /18. Recall that g(z) — > oo as z — > 0, so this is indeed possible; 
the reason for this particular choice of Z will become clear later. Finally 



choose T = T(c, Z),Q = Q(c, Z) according to Proposition It will also be 
convenient to assume that q < T/2. Thus we have 

16q < 8T < 4Z < 2A < 1 < B/2. 

We shall be concerned with "large" B and "very small" q, in which case we 
have 

Later we will let q —>■ while keeping B fixed. It will be important to 
distinguish between quantities which depend only on B (such as A, c, Z, T, Q) 
and those which also depend on q. 
If R is a rectangle we define 



V(R) = qhng(R) g[qshoTt(R) 

and for rectangles R C R' we define 

U(R,R') = w(qdim(R),qdim(R') 

where W is defined as in Section |6|. We claim that for any R we have 

P P (R is internally spanned) < exp — g~ 1 K(i?), (35) 

while for any R C R' such that short(i?') > 2Z/q and - <f>(R) < T/q 

we have 

P p (D(R,R')) < Qexp-q- l (l-2c)U{R,R'). (36) 

Inequality ([35]) follows from Lemma || (ii) and Lemma |9] (i) (using which- 
ever of East- and North- traversability gives the better bound). For (|36[). 
note that dim(i?') - dim(i?) < (T/q,T/q), and short(i?) > 2Z/q - T/q > 
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Z/q, so R, R' satisfy the conditions of Proposition |22|. Combining this with 
Proposition [13] and the definition of U above yields (|36|). 

Now if T~L is any good hierarchy, by BK inequality and the definition of 
H occuring, we have 

PpiH occurs) 



< 



Y\ Ppi D(R V , R u ) ) [ [ P P (R w internally spanned) 



seed 

l^norm ^ 1 



< Q iVnorm exp -q 



seed 



, (37) 



where the first product and sum are over all pairs of vertices u, v such that 
it =r- v, the second product and sum are over all seeds w, and N QOTm is the 
number of normal vertices of 7i. The second inequality in ([37|) follows from 
(|3lf),(j36l) above and from properties (ii),(iii),(iv) of a good hierarchy. 
Next we derive lower bounds for the two sums in (p^). 

Lemma 38 For any good hierarchy 7i with root-label R r = R, there exists a 
rectangle S = S(H) C R satisfying 



dim(S) < dim(^) 



w seed 

such that 

U{R V , R u ) > U(S, R) - N spht 2qg(Z), 
where N sp m is the number of splitters ofTi. 

We call S as above the pod of 7i. The idea is that, if the total size of the 
seeds is not too big, then the pod is not too big, and therefore the first sum 
in (|37|) is large enough to give a good bound. Note that the location of the 
pod is actually immaterial - only its dimensions are ever used. It is defined 
to be a rectangle rather than just a 2- vector as a notational convenience only. 
The pod is function of the hierarchy 7i only, not of B or q (even though the 
definitions of a good hierarchy and of U(-, •) do depend on B and q). 
The following will be needed in the proof of Lemma [38| . 

Lemma 39 // (R' U R") = R then dim(R') + dim( J R") > dim(i?) - (1, 1). 



34 



Proof. This is proved in a similar manner to Lemma |^ (i). If the sum 
of the widths of R', R" is less than the width of R minus one, then either R 
has two adjacent columns which do not intersect R' U R", or the East-most 
or West-most column of R does not intersect R' U R". In either case, no site 
in such a column can be in (R' U R") . □ 



Proof of lemma |38|. The proof is by induction on the number of vertices 
of Ti. Suppose the lemma holds for all hierarchies with fewer vertices than 
TC. We consider three cases according to whether the root r is a seed, normal, 
or a splitter. 

If r is a seed (so it is the only vertex), then we take S = R, and the result 
holds trivially. 

If r is normal, so that r =>- y say, then we apply the inductive hypothesis 
to the sub-hierarchy H! rooted at y (that is, the hierarchy obtained by taking 
all vertices and edges in directed chains y — Vq ~» V\ ~> v 2 ~» • • • away from 
y, together with the associated rectangles). Let S = S(TC) = S(TC'), and 
note that Ji! has the same number of splitters as Ti, to obtain 

J2U(R V ,R U ) > U(R y ,R) + U(S,R y )-N sm 2qg(Z) 

u=>v 

> U{S,R)-N split 2qg{Z), 

by Proposition |T2| and the definition of U(-, ■). 

If r is a splitter, so that r ^ (y±, y-i) say, we apply the inductive hypothesis 
to the sub-hierarchies TCi,Ti,2 rooted at 3/1,3/2, and denote their pods S\ = 
S(Hi),S2 = 5(7^2)- We also write R\ = R Vl ,R2 = R V2 . Since the total 
number of splitters in Hi and TL 2 is one less than N sp \ it , we obtain 

^ U(R V , R u ) > U(S 1 , + U(S 2 , R 2 ) - (N spm - l)2qg(Z). (40) 



Now we apply Proposition 15 with 



a = gdim(S'i), 

b = gdim(i?i). 

c = qdim(S 2 ), 

d = gdim(i?2)- 

r = qdim(R). 
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We choose the pod S = S(TC) to be a rectangle satisfying SCR and 

s = q dim(S'), 



where s is as in Proposition [15]. The formula for s ensures that the dimensions 
of S are indeed integers and that S depends only on TC. Furthermore, since 
s < a + c we have 

dim(S) < dim(Si) + dim(S 2 ) < ^ dim(iC), 

w seed 

by the inductive hypothesis, since the set of seeds of Tt is the disjoint union 
of the sets of seeds of Ti 1 and Ti" . It is easy to check that the conditions of 
Proposition [15] are met, by Lemma |39] and property (ii) of a good hierarchy, 
so we obtain 

U(3 h Ri) + U(S 2 , R 2 ) > U(S, R) - 2qg(Z). 

Combining this with (HO) gives the required bound. □ 

Now we derive a lower bound on the second sum in (^). If w is a seed 
then 

V{Rw) = long(fi w ) g(q short (JQ) > g(2Z) 
q4>(R w ) \ong(R w ) + short(i? U) ) ~~ 2 

by property (i) of a good hierarchy. Hence 



> 



9&Z) 



(41) 



w seed 



seed 



since the pod S satisfies dim(5*) < J2 W seed dim(R w ) by Lemma [38|. 

Substituting from Lemma |3S] and (O) into ([J7|), for any good hierarchy 
Ti. we have 



P P (H occurs) < 

QN Borm QN sp m exp _ g -l 



'l-2c)U(S,R) + ^p-q<P(S) 



(42) 



where Qi — 

Now suppose that 7-^ is a good hierarchy with root-label R r = R = 
R(\B/q], \B/q~\). Let us find an upper bound on the number of vertices 
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of 7i. By properties (iii),(v) of a good hierarchy, in any directed chain of 
vertices r = Vq ~> vi ~> • • • ~> away from the root, at least half the edges 
have (f) decreasing by at least T/(2q), so the number of vertices in such a 
chain is at most 

2\B/q] 20B 

2 ' / q + 1 < . 

T/(2q) ~ T 

Hence, since the graph underlying 7i is a binary tree, the total number of 
vertices in 7i is at most 

M := 2 2QB ' T . 

All that matters is that this number depends only on B, not on q. 

We now divide hierarchies into two different types according to the semi- 
perimeter of the pod. If q<f){S) < A then by Lemma [H] we have 



rq\B/q] rB 

U(S,R)>2 g{z)dz>2 g(z) dz. 

Jq<h(S) J A 



-q\B/q] 

On the other hand, if q<f){S) > A then by the choice of Z, (P5), we have that 



9{2Z) q<P(S) > > 2A > 2(1 - 2c) [ B g(z) dz, 



A 



2 1TX ' ~ 2 
since A = / °° g(z) dz. In both cases we obtain from (|2|) that 



P P {H occurs) < Qf exp -g _1 2(l - 2c) / g(z) dz, (43) 

J A 

where Q 2 = ma.x{Q,Qi}. 

We now bound the total number of possible good hierarchies with root- 
label R. The number of abstract directed graphs with at most M vertices 
is at most M2 M , and the number of different rectangles in R is at most 
(B/q + l) 4 , so the number of possible good hierarchies is at most 

M2 M \2B/q) AM . (44) 

From Proposition [33] and (^), (|44]) we deduce the existence of constants 
K\,K 2 G (0, oo) depending only on B, such that when q is sufficiently small, 

P P (R is internally spanned) < K\q~ K2 exp — q~ x 2\s, 
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where 

fB 



X B = (1 - 2c) J g(z) dz. (45) 
Hence, recalling that q ~ p, we have 



This implies that 



lim inf — p log /( [5 /gl , p) > 2Xb- 



lim inf -plog I([B /p\,p) >2X B . (46) 



To check this note that for any e > we may write \_B/p\ = \(B + e)/q'~\, 
where q' — — log(l — p') and p' ~ p(B + e)/B as p — > 0; since I(L,p) is 
increasing in p this implies that the left side of (HBf) is at least 2Xb+ e B /(B+e), 
establishing fl46|). Finally, since y4 = c= l/-Bwe have from (^) that 

/■oo 

Xb — > I g(z) dz = X as B — ► oo. 

JO 

Hence we have proved Theorem |2| (ii). 



Acknowledgments 

I thank Marek Biskup, Lincoln Chayes, Nathaniel Grossman, Tom Liggett 
and Roberto Schonmann for valuable conversations. 



References 

[1] J. Adler. Bootstrap percolation. Physica A, 171:453-470, 1991. 

[2] J. Adler, D. Stauffer, and A. Aharony. Comparison of bootstrap perco- 
lation models. Journal of Physics A, 22:L297-L301, 1989. 

[3] M. Aizenman and J. L. Lebowitz. Metastability effects in bootstrap 
percolation. J. Phys. A, 21(19):3801-3813, 1988. 

[4] E. D. Andjel, T. S. Mountford, and R. H. Schonmann. Equivalence of 
exponential decay rates for bootstrap-percolation-like cellular automata. 
Ann. Inst. H. Poincare Probab. Statist., 31(1): 13-25, 1995. 



38 



[5] R. Cerf and E. N. M. Cirillo. Finite size scaling in three-dimensional 
bootstrap percolation. Ann. Probab., 27(4):1837-1850, 1999. 

[6] R. Cerf and F. Manzo. The threshold regime of finite volume bootstrap 
percolation. To appear. 

[7] L. R. Fontes, V. Sidoravicius, and R. H. Schonmann. Stretched expo- 
nential fixation in stochastic ising models at zero temperature. Commu- 
nications in Mathematical Physics. To appear. 

[8] K. Frobose. Finite-size effects in a cellular automaton for diffusion. J. 
Statist. Phys., 55(5-6): 1285-1292, 1989. 

[9] I. S. Gradshteyn and 1. M. Ryzhik. Table of integrals, series, and prod- 
ucts. Academic Press, New York, 1965. 

[10] J. Gravner and D. Griffeath. Scaling laws for a class of critical cellular 
automaton growth rules. In Random walks (Budapest, 1998), pages 167- 
186. Janos Bolyai Math. Soc, Budapest, 1999. 

[11] G. R. Grimmett. Percolation. Springer- Verlag, second edition, 1999. 

[12] S. Kirkpatrick, W. Wilcke, R. Garner, and H. Huels. Percolation in 
dense storage arrays. Physica A. To appear. 

[13] R. H. Schonmann. Finite size scaling behavior of a biased majority rule 
cellular automaton. Phys. A, 167(3) :619-627, 1990. 

[14] R. H. Schonmann. On the behavior of some cellular automata related 
to bootstrap percolation. Ann. Probab., 20(1):174-193, 1992. 

[15] R. H. Schonmann. Slow droplet-driven relaxation of stochastic Ising 
models in the vicinity of the phase coexistence region. Comm. Math. 
Phys., 161(l):l-49, 1994. 

[16] R. H. Schonmann and S. B. Shlosman. Wulff droplets and the metastable 
relaxation of kinetic Ising models. Comm. Math. Phys., 194(2):389-462, 
1998. 

[17] A. C. D. van Enter, J. Adler, and J. A. M. S. Duarte. Finite-size effects 
for some bootstrap percolation models. J. Statist. Phys., 60(3-4):323- 
332, 1990. 



39 



[18] A. C. D. van Enter, J. Adler, and J. A. M. S. Duarte. Addendum: 
"Finite size effects for some bootstrap percolation models". J. Statist. 
Phys., 62(l-2):505-506, 1991. 

[19] S. Wolfram, editor. Theory and applications of cellular automata. World 
Scientific Publishing Co., Singapore, 1986. Including selected papers 
1983-1986. 

Alexander E. Holroyd 
UCLA Department of Mathematics 
405 Hilgard Avenue 
Los Angeles 
CA 90095-1555 
U. S. A. 

holroyd@math . ucla . edu 



40 



